Overview

Dataset statistics

Number of variables11
Number of observations5695
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory511.7 KiB
Average record size in memory92.0 B

Variable types

Numeric11

Alerts

gross_revenue is highly correlated with qtd_invoice and 4 other fieldsHigh correlation
recencydays is highly correlated with qtd_invoiceHigh correlation
qtd_invoice is highly correlated with gross_revenue and 4 other fieldsHigh correlation
qtd_items is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qtd_products is highly correlated with gross_revenue and 3 other fieldsHigh correlation
frequency is highly correlated with qtd_invoiceHigh correlation
qtd_return is highly correlated with qtd_invoiceHigh correlation
avg_basket_size is highly correlated with gross_revenue and 3 other fieldsHigh correlation
avg_unique_basket_size is highly correlated with gross_revenue and 2 other fieldsHigh correlation
gross_revenue is highly correlated with qtd_invoice and 1 other fieldsHigh correlation
qtd_invoice is highly correlated with gross_revenueHigh correlation
qtd_items is highly correlated with gross_revenue and 2 other fieldsHigh correlation
avg_ticket is highly correlated with qtd_items and 1 other fieldsHigh correlation
qtd_return is highly correlated with qtd_items and 1 other fieldsHigh correlation
avg_basket_size is highly correlated with avg_unique_basket_sizeHigh correlation
avg_unique_basket_size is highly correlated with avg_basket_sizeHigh correlation
gross_revenue is highly correlated with qtd_invoice and 3 other fieldsHigh correlation
qtd_invoice is highly correlated with gross_revenue and 1 other fieldsHigh correlation
qtd_items is highly correlated with gross_revenue and 1 other fieldsHigh correlation
qtd_products is highly correlated with gross_revenue and 1 other fieldsHigh correlation
frequency is highly correlated with qtd_invoiceHigh correlation
avg_basket_size is highly correlated with gross_revenue and 1 other fieldsHigh correlation
avg_unique_basket_size is highly correlated with qtd_productsHigh correlation
customerid is highly correlated with recencydaysHigh correlation
gross_revenue is highly correlated with qtd_invoice and 4 other fieldsHigh correlation
recencydays is highly correlated with customeridHigh correlation
qtd_invoice is highly correlated with gross_revenue and 1 other fieldsHigh correlation
qtd_items is highly correlated with gross_revenue and 2 other fieldsHigh correlation
qtd_products is highly correlated with gross_revenue and 1 other fieldsHigh correlation
avg_ticket is highly correlated with gross_revenue and 2 other fieldsHigh correlation
qtd_return is highly correlated with gross_revenue and 2 other fieldsHigh correlation
avg_basket_size is highly correlated with avg_unique_basket_sizeHigh correlation
avg_unique_basket_size is highly correlated with avg_basket_sizeHigh correlation
gross_revenue is highly skewed (γ1 = 21.33147068) Skewed
qtd_items is highly skewed (γ1 = 34.31160325) Skewed
avg_ticket is highly skewed (γ1 = 53.30430577) Skewed
qtd_return is highly skewed (γ1 = 51.46514451) Skewed
customerid has unique values Unique
qtd_return has 4143 (72.7%) zeros Zeros

Reproduction

Analysis started2022-09-21 09:26:13.887749
Analysis finished2022-09-21 09:27:06.349130
Duration52.46 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

customerid
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct5695
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31232.13942
Minimum12346
Maximum83709
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size66.7 KiB
2022-09-21T06:27:06.894117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum12346
5-th percentile12699.1
Q114288.5
median16229
Q318210.5
95-th percentile82731.1
Maximum83709
Range71363
Interquartile range (IQR)3922

Descriptive statistics

Standard deviation28408.38395
Coefficient of variation (CV)0.909588151
Kurtosis-0.5185359982
Mean31232.13942
Median Absolute Deviation (MAD)1962
Skewness1.210180524
Sum177867034
Variance807036278.5
MonotonicityNot monotonic
2022-09-21T06:27:07.134136image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
178501
 
< 0.1%
163441
 
< 0.1%
129221
 
< 0.1%
820971
 
< 0.1%
165891
 
< 0.1%
137301
 
< 0.1%
168661
 
< 0.1%
820951
 
< 0.1%
820941
 
< 0.1%
820931
 
< 0.1%
Other values (5685)5685
99.8%
ValueCountFrequency (%)
123461
< 0.1%
123471
< 0.1%
123481
< 0.1%
123491
< 0.1%
123501
< 0.1%
123521
< 0.1%
123531
< 0.1%
123541
< 0.1%
123551
< 0.1%
123561
< 0.1%
ValueCountFrequency (%)
837091
< 0.1%
837081
< 0.1%
837071
< 0.1%
837061
< 0.1%
837051
< 0.1%
837041
< 0.1%
837001
< 0.1%
836991
< 0.1%
836961
< 0.1%
836951
< 0.1%

gross_revenue
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct5461
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1862.766266
Minimum0.42
Maximum280206.02
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:07.400115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0.42
5-th percentile13.635
Q1244.605
median639.89
Q31653.865
95-th percentile5522.757
Maximum280206.02
Range280205.6
Interquartile range (IQR)1409.26

Descriptive statistics

Standard deviation7963.897229
Coefficient of variation (CV)4.275306771
Kurtosis594.1773042
Mean1862.766266
Median Absolute Deviation (MAD)501.75
Skewness21.33147068
Sum10608453.88
Variance63423659.07
MonotonicityNot monotonic
2022-09-21T06:27:07.652114image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.959
 
0.2%
1.258
 
0.1%
4.958
 
0.1%
2.958
 
0.1%
12.757
 
0.1%
3.757
 
0.1%
1.657
 
0.1%
7.56
 
0.1%
5.956
 
0.1%
4.256
 
0.1%
Other values (5451)5623
98.7%
ValueCountFrequency (%)
0.421
 
< 0.1%
0.651
 
< 0.1%
0.791
 
< 0.1%
0.843
 
0.1%
0.853
 
0.1%
1.071
 
< 0.1%
1.258
0.1%
1.441
 
< 0.1%
1.657
0.1%
1.691
 
< 0.1%
ValueCountFrequency (%)
280206.021
< 0.1%
259657.31
< 0.1%
194550.791
< 0.1%
168472.51
< 0.1%
143825.061
< 0.1%
124914.531
< 0.1%
117379.631
< 0.1%
91062.381
< 0.1%
81024.841
< 0.1%
77183.61
< 0.1%

recencydays
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct304
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116.7311677
Minimum0
Maximum373
Zeros38
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:07.928134image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q122.5
median71
Q3199
95-th percentile337.3
Maximum373
Range373
Interquartile range (IQR)176.5

Descriptive statistics

Standard deviation111.5236412
Coefficient of variation (CV)0.955388723
Kurtosis-0.6387979375
Mean116.7311677
Median Absolute Deviation (MAD)61
Skewness0.8160444818
Sum664784
Variance12437.52255
MonotonicityNot monotonic
2022-09-21T06:27:08.179130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1110
 
1.9%
4105
 
1.8%
398
 
1.7%
292
 
1.6%
1086
 
1.5%
882
 
1.4%
980
 
1.4%
1779
 
1.4%
778
 
1.4%
2265
 
1.1%
Other values (294)4820
84.6%
ValueCountFrequency (%)
038
 
0.7%
1110
1.9%
292
1.6%
398
1.7%
4105
1.8%
552
0.9%
778
1.4%
882
1.4%
980
1.4%
1086
1.5%
ValueCountFrequency (%)
37323
0.4%
37222
0.4%
37117
0.3%
3694
 
0.1%
36813
0.2%
36716
0.3%
36615
0.3%
36519
0.3%
36411
0.2%
3627
 
0.1%

qtd_invoice
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct59
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.49165935
Minimum1
Maximum210
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:08.449117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile11.3
Maximum210
Range209
Interquartile range (IQR)3

Descriptive statistics

Standard deviation6.868852155
Coefficient of variation (CV)1.96721715
Kurtosis308.5962454
Mean3.49165935
Median Absolute Deviation (MAD)0
Skewness13.32902721
Sum19885
Variance47.18112993
MonotonicityNot monotonic
2022-09-21T06:27:08.687119image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12855
50.1%
2831
 
14.6%
3508
 
8.9%
4386
 
6.8%
5243
 
4.3%
6172
 
3.0%
7143
 
2.5%
898
 
1.7%
968
 
1.2%
1054
 
0.9%
Other values (49)337
 
5.9%
ValueCountFrequency (%)
12855
50.1%
2831
 
14.6%
3508
 
8.9%
4386
 
6.8%
5243
 
4.3%
6172
 
3.0%
7143
 
2.5%
898
 
1.7%
968
 
1.2%
1054
 
0.9%
ValueCountFrequency (%)
2101
< 0.1%
2011
< 0.1%
1241
< 0.1%
971
< 0.1%
931
< 0.1%
911
< 0.1%
861
< 0.1%
741
< 0.1%
631
< 0.1%
621
< 0.1%

qtd_items
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct817
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean267.35259
Minimum1
Maximum80996
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:08.928128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q142
median99
Q3210
95-th percentile650.3
Maximum80996
Range80995
Interquartile range (IQR)168

Descriptive statistics

Standard deviation1741.302866
Coefficient of variation (CV)6.513132585
Kurtosis1435.040611
Mean267.35259
Median Absolute Deviation (MAD)71
Skewness34.31160325
Sum1522573
Variance3032135.67
MonotonicityNot monotonic
2022-09-21T06:27:09.219119image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1236
 
4.1%
3116
 
2.0%
653
 
0.9%
2850
 
0.9%
1648
 
0.8%
2148
 
0.8%
6742
 
0.7%
1241
 
0.7%
5241
 
0.7%
3641
 
0.7%
Other values (807)4979
87.4%
ValueCountFrequency (%)
1236
4.1%
226
 
0.5%
3116
2.0%
425
 
0.4%
521
 
0.4%
653
 
0.9%
725
 
0.4%
819
 
0.3%
917
 
0.3%
1039
 
0.7%
ValueCountFrequency (%)
809961
< 0.1%
742151
< 0.1%
386391
< 0.1%
213521
< 0.1%
173761
< 0.1%
171501
< 0.1%
162881
< 0.1%
158531
< 0.1%
133691
< 0.1%
128721
< 0.1%

qtd_products
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct440
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.89341528
Minimum1
Maximum1787
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:09.927141image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q113
median36
Q385
95-th percentile242
Maximum1787
Range1786
Interquartile range (IQR)72

Descriptive statistics

Standard deviation101.8984273
Coefficient of variation (CV)1.457911691
Kurtosis43.73746456
Mean69.89341528
Median Absolute Deviation (MAD)28
Skewness4.695909111
Sum398043
Variance10383.28948
MonotonicityNot monotonic
2022-09-21T06:27:10.252175image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1264
 
4.6%
2156
 
2.7%
3115
 
2.0%
11100
 
1.8%
897
 
1.7%
597
 
1.7%
795
 
1.7%
1094
 
1.7%
491
 
1.6%
990
 
1.6%
Other values (430)4496
78.9%
ValueCountFrequency (%)
1264
4.6%
2156
2.7%
3115
2.0%
491
 
1.6%
597
 
1.7%
689
 
1.6%
795
 
1.7%
897
 
1.7%
990
 
1.6%
1094
 
1.7%
ValueCountFrequency (%)
17871
< 0.1%
17681
< 0.1%
13231
< 0.1%
11191
< 0.1%
11101
< 0.1%
8841
< 0.1%
8191
< 0.1%
7491
< 0.1%
7311
< 0.1%
7211
< 0.1%

avg_ticket
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct5507
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.61760545
Minimum0.42
Maximum77183.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:10.596117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0.42
5-th percentile3.465
Q18.510502308
median16.12
Q322.57508013
95-th percentile76.32
Maximum77183.6
Range77183.18
Interquartile range (IQR)14.06457782

Descriptive statistics

Standard deviation1281.098642
Coefficient of variation (CV)23.45578191
Kurtosis2955.510013
Mean54.61760545
Median Absolute Deviation (MAD)7.217
Skewness53.30430577
Sum311047.263
Variance1641213.73
MonotonicityNot monotonic
2022-09-21T06:27:10.955117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.7511
 
0.2%
4.9510
 
0.2%
1.259
 
0.2%
2.959
 
0.2%
7.958
 
0.1%
8.257
 
0.1%
12.757
 
0.1%
1.657
 
0.1%
4.156
 
0.1%
3.356
 
0.1%
Other values (5497)5615
98.6%
ValueCountFrequency (%)
0.422
< 0.1%
0.5351
 
< 0.1%
0.651
 
< 0.1%
0.791
 
< 0.1%
0.83714285711
 
< 0.1%
0.842
< 0.1%
0.853
0.1%
1.0022222221
 
< 0.1%
1.021
 
< 0.1%
1.038751
 
< 0.1%
ValueCountFrequency (%)
77183.61
< 0.1%
56157.51
< 0.1%
13305.51
< 0.1%
4453.431
< 0.1%
38611
< 0.1%
30961
< 0.1%
2027.861
< 0.1%
1687.21
< 0.1%
1377.0777781
< 0.1%
1001.21
< 0.1%

frequency
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct1225
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5475706259
Minimum0.005449591281
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:11.284133image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0.005449591281
5-th percentile0.01102941176
Q10.02492211838
median1
Q31
95-th percentile1
Maximum17
Range16.99455041
Interquartile range (IQR)0.9750778816

Descriptive statistics

Standard deviation0.5505967909
Coefficient of variation (CV)1.005526529
Kurtosis138.7856997
Mean0.5475706259
Median Absolute Deviation (MAD)0
Skewness4.851371477
Sum3118.414715
Variance0.3031568261
MonotonicityNot monotonic
2022-09-21T06:27:11.592169image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12879
50.6%
248
 
0.8%
0.062518
 
0.3%
0.0277777777817
 
0.3%
0.0238095238116
 
0.3%
0.0909090909115
 
0.3%
0.0833333333315
 
0.3%
0.0294117647114
 
0.2%
0.0344827586214
 
0.2%
0.0769230769213
 
0.2%
Other values (1215)2646
46.5%
ValueCountFrequency (%)
0.0054495912811
 
< 0.1%
0.0054644808741
 
< 0.1%
0.0054794520551
 
< 0.1%
0.0054945054951
 
< 0.1%
0.0055865921792
< 0.1%
0.0056022408961
 
< 0.1%
0.0056179775282
< 0.1%
0.005665722381
 
< 0.1%
0.0056818181822
< 0.1%
0.0056980056983
0.1%
ValueCountFrequency (%)
171
 
< 0.1%
41
 
< 0.1%
35
 
0.1%
248
 
0.8%
1.1428571431
 
< 0.1%
12879
50.6%
0.751
 
< 0.1%
0.66666666673
 
0.1%
0.5508021391
 
< 0.1%
0.53351206431
 
< 0.1%

qtd_return
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct219
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.01720808
Minimum0
Maximum80995
Zeros4143
Zeros (%)72.7%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:11.915118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile40
Maximum80995
Range80995
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1475.325676
Coefficient of variation (CV)30.72493664
Kurtosis2713.85495
Mean48.01720808
Median Absolute Deviation (MAD)0
Skewness51.46514451
Sum273458
Variance2176585.85
MonotonicityNot monotonic
2022-09-21T06:27:12.162127image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
04143
72.7%
1190
 
3.3%
2156
 
2.7%
3107
 
1.9%
490
 
1.6%
672
 
1.3%
564
 
1.1%
1249
 
0.9%
849
 
0.9%
748
 
0.8%
Other values (209)727
 
12.8%
ValueCountFrequency (%)
04143
72.7%
1190
 
3.3%
2156
 
2.7%
3107
 
1.9%
490
 
1.6%
564
 
1.1%
672
 
1.3%
748
 
0.8%
849
 
0.9%
938
 
0.7%
ValueCountFrequency (%)
809951
< 0.1%
742151
< 0.1%
93611
< 0.1%
90141
< 0.1%
80601
< 0.1%
46271
< 0.1%
37681
< 0.1%
33351
< 0.1%
29751
< 0.1%
21601
< 0.1%

avg_basket_size
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2375
Distinct (%)41.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04377848893
Minimum1.347436502 × 10-5
Maximum1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:12.415128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1.347436502 × 10-5
5-th percentile0.001363210568
Q10.003448275862
median0.006622516556
Q30.0133742249
95-th percentile0.25
Maximum1
Range0.9999865256
Interquartile range (IQR)0.009925949037

Descriptive statistics

Standard deviation0.1516964889
Coefficient of variation (CV)3.465091934
Kurtosis28.77128418
Mean0.04377848893
Median Absolute Deviation (MAD)0.003846281132
Skewness5.278378743
Sum249.3184945
Variance0.02301182474
MonotonicityNot monotonic
2022-09-21T06:27:12.657130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1110
 
1.9%
0.571
 
1.2%
0.333333333353
 
0.9%
0.2550
 
0.9%
0.236
 
0.6%
0.166666666728
 
0.5%
0.0833333333326
 
0.5%
0.0121
 
0.4%
0.0136986301421
 
0.4%
0.00943396226420
 
0.4%
Other values (2365)5259
92.3%
ValueCountFrequency (%)
1.347436502 × 10-51
< 0.1%
2.469227255 × 10-51
< 0.1%
7.067637289 × 10-51
< 0.1%
7.165376899 × 10-51
< 0.1%
0.00012781186091
< 0.1%
0.00016640781011
< 0.1%
0.00016767270291
< 0.1%
0.00019238168531
< 0.1%
0.00023255813951
< 0.1%
0.00023364485981
< 0.1%
ValueCountFrequency (%)
1110
1.9%
0.66666666671
 
< 0.1%
0.571
1.2%
0.333333333353
0.9%
0.31
 
< 0.1%
0.2550
0.9%
0.236
 
0.6%
0.18751
 
< 0.1%
0.17647058821
 
< 0.1%
0.166666666728
 
0.5%

avg_unique_basket_size
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1282
Distinct (%)22.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1326961552
Minimum0.0008976660682
Maximum1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size89.0 KiB
2022-09-21T06:27:12.894116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0.0008976660682
5-th percentile0.005704545455
Q10.02777777778
median0.05555555556
Q30.1111111111
95-th percentile0.6383116883
Maximum1
Range0.9991023339
Interquartile range (IQR)0.08333333333

Descriptive statistics

Standard deviation0.2213233687
Coefficient of variation (CV)1.667895866
Kurtosis8.64729026
Mean0.1326961552
Median Absolute Deviation (MAD)0.03514739229
Skewness3.021774538
Sum755.7046039
Variance0.04898403353
MonotonicityNot monotonic
2022-09-21T06:27:13.150130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1271
 
4.8%
0.5162
 
2.8%
0.3333333333115
 
2.0%
0.07692307692103
 
1.8%
0.197
 
1.7%
0.111111111196
 
1.7%
0.2596
 
1.7%
0.295
 
1.7%
0.0714285714395
 
1.7%
0.0909090909194
 
1.7%
Other values (1272)4471
78.5%
ValueCountFrequency (%)
0.00089766606821
< 0.1%
0.0013351134851
< 0.1%
0.0013679890561
< 0.1%
0.0013869625521
< 0.1%
0.0014184397161
< 0.1%
0.0014556040761
< 0.1%
0.0014792899411
< 0.1%
0.0014814814811
< 0.1%
0.0015105740181
< 0.1%
0.0015337423311
< 0.1%
ValueCountFrequency (%)
1271
4.8%
0.83333333331
 
< 0.1%
0.81
 
< 0.1%
0.752
 
< 0.1%
0.66666666679
 
0.2%
0.64285714291
 
< 0.1%
0.63636363641
 
< 0.1%
0.64
 
0.1%
0.54545454551
 
< 0.1%
0.52631578951
 
< 0.1%

Interactions

2022-09-21T06:27:03.036121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:33.830119image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:36.716118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:39.710121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:42.635401image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:45.093118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:48.495118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:51.376165image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:54.544131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:57.405113image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:00.000118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:03.238113image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:34.108125image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:36.974163image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:39.955158image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:42.851430image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:45.335121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:48.815123image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:51.708116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:54.833131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:57.639128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:00.259117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:03.434134image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:34.352116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:37.241138image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:40.194116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:43.044427image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:45.907135image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:49.085129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:51.992159image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:55.070127image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:57.855118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:00.470121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:03.618132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:34.599117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:37.535123image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:40.421131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:43.273478image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:46.168120image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:49.326113image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:52.244131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:55.323116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:58.087129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:00.736139image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:03.824116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:34.867115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:37.784118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:40.659122image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:43.491131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:46.394115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:49.558116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:52.457160image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:55.536117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:58.293163image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:01.030121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:04.052159image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:35.163118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:38.091166image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:40.961121image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:43.723118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:46.663117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:49.804120image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:52.760114image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:55.853116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:58.540116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:01.328118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:04.277132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:35.439122image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:38.389167image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:41.321718image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:43.938131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:46.933119image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:50.039118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:53.029163image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:56.175130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:58.749117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:01.600117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:04.507160image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:35.698117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:38.715135image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:41.622252image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:44.170132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:47.250132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:50.286116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:53.292118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:56.468128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:58.984129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:01.889118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:04.722162image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:35.958120image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:38.978135image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:41.918250image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:44.389115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:47.550134image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:50.538139image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:53.534117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:56.726158image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:59.231136image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:02.159117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:04.956131image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:36.228130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:39.211128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:42.165261image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:44.653130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:47.894118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:50.841134image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:53.801132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:56.960130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:59.514136image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:02.403115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:05.166116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:36.470116image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:39.452130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:42.396394image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:44.869129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:48.194119image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:51.093117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:54.027117image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:57.180118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:26:59.742129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2022-09-21T06:27:02.835161image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Correlations

2022-09-21T06:27:13.609129image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-09-21T06:27:13.950130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-09-21T06:27:14.283128image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-09-21T06:27:14.631132image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-09-21T06:27:05.479134image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
A simple visualization of nullity by column.
2022-09-21T06:27:05.896115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

customeridgross_revenuerecencydaysqtd_invoiceqtd_itemsqtd_productsavg_ticketfrequencyqtd_returnavg_basket_sizeavg_unique_basket_size
0178505391.2137234352118.15222217.00000040.00.0196190.114478
1130473237.54311013210618.8229070.02830236.00.0071890.058140
2125837281.38215156911529.4792710.04032351.00.0029640.060729
313748948.259551692433.8660710.0179210.00.0113900.178571
415100876.003333481292.0000000.07317122.00.0375001.000000
5152914668.3025155086245.3233010.04011529.00.0071330.145631
6146885630.8772157914817.2197860.057221399.00.0058000.064220
7178095411.9116129614688.7198360.03352042.00.0058340.196721
81531160767.90091216756725.5434640.243316474.00.0023830.038251
9160982005.638772403429.9347760.0243900.00.0114190.104478

Last rows

customeridgross_revenuerecencydaysqtd_invoiceqtd_itemsqtd_productsavg_ticketfrequencyqtd_returnavg_basket_sizeavg_unique_basket_size
5685837004839.42119175578.0551611.00.00.0009310.016129
568613298360.0011962180.0000001.00.00.0104170.500000
568714569227.3911701018.9491671.00.00.0126580.083333
56888370417.9011272.5571431.00.00.0714290.142857
5689837053.3511121.6750001.00.00.5000000.500000
5690837066637.591143063510.4528981.00.00.0005720.001575
5691837077689.230134773110.5187821.00.00.0004970.001368
5692837083217.20015245654.5288141.00.00.0015290.016949
5693837095664.890121121825.9857341.00.00.0013660.004587
569412713848.55011013822.3302631.00.00.0019690.026316